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Abstract 

The O(d) Synchronization problem consists of estimating a set of unknown orthogonal transformations 
Oi from noisy measurements of a subset of the pairwise ratios OiOJ 1 . We formulate and prove a Cheeger- 
type inequality that relates a measure of how well it is possible to solve the O(d) synchronization problem 
with the spectra of an operator, the graph Connection Laplacian. We also show how this inequality 
provides a worst case performance guarantee for a spectral method to solve this problem. 

Keywords: O(d) Synchronization, Cheeger Inequality, Graph Connection Laplacian, Vector Diffusion Maps. 



1 Introduction 

The Synchronization problem over a group Q on a graph G = (V, E) consists of estimating a group potential 
g : V — > Q on the vertices given, for each edge (i, j) <E E, a measurement of the offset = gigj 1 ■ If the 
measurements are noiseless and the graph is connected then the problem can be easily solved by considering 
a spanning tree in G and sequentially determining the group potential values by traveling along the tree and 
sequentially multiplying the offsets. The potential obtained is uniquely determined up to an overall group 
translation, that corresponds to fixing its value on one node. 

The case Q = 0(d), the group of d x d orthogonal matrices (rotations, reflections and compositions of both 
on M. d ), is of particular interest in several applications. For example, when d = 1, i.e. Q = 0(1) = Z/2Z, 
the synchronization solution can be used to determine whether a manifold is orientable [14] . An interesting 
aspect of the 0(1) synchronization problem is that it is a generalization of the max-cut problem [17] as 
we will see later. The 0{d) synchronization also plays a major role in a certain algorithm for the sensor 
network localization problem [TJ. The, very similar, synchronization in 50(<ij3 also has several applications; 
for d = 3, it can be used, e. g., for global alignment of 3-D scans [TB] and the SO(2) problem plays a major 
role in new developments on the Cryo-electron microscopy problem (see [12U15) ). Several more applications 
for SO(2) synchronization are presented in and references therein. 

When there is noise in the offset measurements, the spanning tree method will suffer from accumulation of 
errors. Thus, there is the need for methods that use the redundancy in the information given and globally 
estimate all values simultaneously. Such a method is proposed and analyzed in for Q = SO{2). This 
method can be adapted to deal with other groups of orthogonal matrices, as we will see later. In a nutshell, 
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the method consists of constructing an operator (a matrix) whose eigenvectors can be used to estimate the 
group potential. In [11], the resilience to a certain type of random noise if analyzed. There is, however, no 
worst case guarantee. 

Nevertheless, some other spectral methods have worst case performance guarantees. In fact, Chceger's 
Inequality for graphs [TJ [5] provides such a guarantee for spectral clustering. It relates the spectra of the 
graph Laplacian with bottlenecks in the graph. Informally, it says that clusters are the only obstacles to the 
rapid mixing of a random walk on a graph. In the present paper we provide a similar guarantee for 0(d) 
Synchronization via a Chceger-like inequality for the graph Connection Laplacian. 

Recently, a notion of vector diffusion on graphs has been introduced [13| . One of its purposes is to deal 
with data for which there is a natural rotation group action^. Very roughly, the diffusion process, instead of 
moving point masses between vertices, moves vectors and rotates them depending on the relative rotation 
between the data points. If there is a cycle with an inconsistent set of relative rotations then cancelations 
will take place in the diffusion process and there will be loss of mass. This loss of mass can be measured by 
the spectra of the graph Connection Laplacian [13]. In the present paper we show, for Q = 0(d), a type of 
a Cheeger Inequality (Theorem [TO]) relating the spectra of this operator with a measure of inconsistency on 
the offsets over cycles of the graph, showing that they are, in fact, responsible for the mass loss of the vector 
diffusion process. 

The inequality shown in the present paper provides a worst case performance guarantee for a spectral 
method (Algorithm I16|) to solve the 0(d) synchronization problem. It is worth noting that, in a different 
setup, Trevisan [T7] showed a particular case of this inequality, when the group is 0(1) = Z/2Z and all 
the offsets are —1. In that case the problem is equivalent to the Max-Cut problem. Therefore, this O(l) 
inequality gives a performance guarantee for a spectral method to solve Max-Cut |17j . 

Throughout the paper we use the notation [n] to refer to {1, . . . , n}. Also, we make use of several matrix and 
vector notations. Given a matrix A we denote by ||^4||_f its Frobenius norm. If A is symmetric we denote 
by Xi(A), \2(A), ... its eigenvalues in increasing order. Assuming further that A is positive semi-definite we 
define the A-inner product of vectors x and y as (x, y) a = x 1 Ay (and say that two vectors are A-orthogonal 
if it is zero). Also, we define the A-norm of x as ||x||,t = yj (x, x) a- Given x e (M d )" we denote by xt the 
i-th d x 1 block of x (that will correspond to the value of x on the vertex i). Also, for t > 0, we define x l as 



Finally, x denotes x , with the small nuance that if x% = for some i then Xi can simply be any unit vector 



The rest of this section includes mathematical preliminaries that are needed in later sections. Section 2 
consists of the solution to a subproblem and Section 3 contains both the formulation and proof of our main 
result. We discuss an Algorithm for 0(d) synchronization in Section 4 and show a few tightness results in 
Section 5. We end with some concluding remarks and a few open problems in Section 6. 

1.1 Preliminaries 

Let G = (V, E) be an undirected weighted graph with n vertices. Let Wo be the weighted adjacency matrix 
of G and Do be the degree matrix, a diagonal matrix with diagonal elements (Dq) u = Y^ij=i w ij- The matrix 
D^Wq is the transition probability matrix of the random walk in G, whose transition probabilities are 
proportional to the edge weights. The graph Laplacian of G is defined as Lq = Do — Wq. It is also useful to 

1/2 1/2 1/2 1/2 

consider the normalized graph Laplacian Cq = D Q LqD q = I — D a WqD q . Both Lq and Cq are 

2 For example, a data set of hand written digits where one wants to determine whether two written digits are the same up 
to rotation. 
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symmetric positive semi-definite matrices. The eigenvalues of Cq encode important information about the 
random walk. In fact, the second smallest eigenvalue!! is a good measure of how well the random walk mixes. 
More specifically, the smaller A2(£o), the slower the convergence to the limiting stationary distribution. 

It is clear that clusters, i.e. sets that are very internally connected but poorly connected to the rest of the 
graph, will constitute obstacles to rapid mixing of the random walk, since the probability mass might be 
trapped inside such a set for a while. Chcegcr's Inequality shows that, in some sense, these sets are the only 
obstacles to rapid mixing (several different proofs for this inequality can be found in [5]). 

Theorem 1 ([HUD- Let G = {V,E) be a graph and Cq its normalized graph Laplacian. Then 

1 , Cut ( 5 ) 



9 -A 2 (£ ) < min . < y/2X 2 (C ), 

2 scv mm{vol(D), vol(£> c )} 

where cut (S) = J2ies j^s Wi i o-nd vo\(S) = Y^ies d%> with di = Ylje.v w vi ^ e degree of the i-th vertex. 

If, associated to each edge £ E, there is an orthogonal transformation^ p^ £ 0(d), one might be 

interested in considering a random walk that takes the transformations into account. One way of doing this 
is by defining a random walk that, instead of moving point masses, moves a vector from vertex to vertex and 
transforms it via the orthogonal transformation associated with the edge. A Laplacian for this diffusion, the 
graph connection Laplacian, was introduced in |13j . The construction goes as follows: define the symmetric 
matrix W\ £ R dnxdn ; such that the (i,j)-th dx d block is given by (Wi)ij = WijPij, where Wy is the weight 
of the edge Also, let D\ £ l' i ' ix ' 1 ' 1 , be a diagonal matrix such that (D\)u = dil^xd- The graph 

Connection Laplacian L\ is defined as L\ = D\ — W\, and the normalized graph Connection Laplacian as 

d =I-D~ 1/2 W 1 D~ 1/2 . 



One natural motivation for this random walk is the setting in which the vertices of the graph represent a 
data set Z = {zi, . . . , z„} with an action of 0(d) defined on it. Such an example is a set of images and their 
rotations and reflections (corresponding to d = 2). On this type of data set, one is interested in comparing 
two data points when they are optimally aligned. Thus, we set to be the element in 0(d) that makes Zj 
as close as possible (in some meaningful distance) to z.;, meaning 

Pij = argmin pe0((i) \\ Zi - p Zj \\, 

and set the affinity between the data points accordingly. 

In this case, the vector random walk (or diffusion) can be thought of as a random walk on the orbits of the 
group action. This kind of walk might be more meaningful because it attempts to quotient out the group 
action. If all the data points lie in a single orbit then there exists a group potential O : V — > 0(d) such that, 
V(i,j) £ E, = OiOj 1 . Finding such a group potential is the objective of the Synchronization problem. 
In this case the walk should have a stationary distribution, corresponding to the null space of the graph 
Connection Laplacian. In fact, by computing the Rayleigh quotient 

X T L 1 X _ l J2{i,j)£E w ij (iMl 2 + INI 2 -ZxfPijXj) _ 1 J2(i,j)eE w ij IN ~ Pij%jf m 

x T Dix 2 J2iev di 

one notes that each column of the dn x d matrix representation of the potential O is an eigenvector (in 
the null space) of the graph Connection Laplacian. This observation motivates the spectral method for 
Synchronization. As it turns out, this method tends to be very robust to outliers [TT], meaning that even if 
several measurements are extremely corrupted the potential recovered is likely to still be very close to the 
original one. 

3 Wc note that the smallest eigenvalue is always 0. 

4 Recall that 0(d) is the group of orthogonal transformations in M d . 
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In fact, we will be interested in cases for which a rotation group potential may not exist (due to, e.g., errors 
in measurements) and the objective will be to estimate the "best possible" potential. Inspired by ([T]) we 
define a penalty function for the synchronization problem as, given a potential O, 

As Wij = when ^ E, we can sum over all pairs of vertices without loss of generality. Since our 

objective is to minimize this function, we define the 0(d) frustration £ 2 constant of G as 

vq = min v{0). 

O-.V^O(d) 

It is unclear which penalty function one should consider. Considering the sum of the squares of the Frobcnius 
norms of the incompatibilities (the £2 penalty function) is more close in spirit to ([1]) and thus more similar to 
what the spectral method will try to minimize, which suggests that this penalty function is easier to analyze. 
On the other hand, considering the sum of the Frobenius norms of the incompatibilities (in some sense, an 
£\ penalty function) will induce sparsity, meaning that it will favor candidate solutions for which some edges 
are perfectly correct even if it has some edges with a large error. This might make more sense in some noise 
models on the edge measurements. In fact, the noise model analyzed in [TT| consists of a few randomly 
chosen edges having a measurement that is randomly drawn with respect to the uniform distribution in the 
space of possible measurements (in our case O(d), in [TT] SO(2) ), therefore the original rotation potential 
will perfectly agree with some edges and have a large error on others. This suggests that considering the £\ 
penalty function may be more suitable to recover the original rotation potential. For that reason we also 
analyze it, where we formally define the £\ penalty function as 



and the 0{d) frustration £\ constant of G as 



$ G = min d{0). 

O-.V^O(d) 



2 The S d - 1 localizat ion problem 

Let us consider a simpler version of the problem. Suppose we just want to estimate one column of the 
synchronization solution. We can state it as estimating a function v : V — > S^ 1 such that, for (i,j) G E, 
i>i = Pij v j- This can be thought of as a localization problem on S d_1 , the unit sphere in K d . 

Again, such a function may not exist in general. Similarly to the penalty function for O(d) synchronization 
we define an £\ penalty function (,(v) and an £2 version rj{v) as, respectively, 

v iJ2ijWij\\vi- pijVjW lEij^ijlk -PijVjW 2 

We then define (g, the S d_1 frustration £\ constant of G, as 

Cg = min C(u), 

and similarly, we define tjq, the S d_1 frustration £2 constant of G, as 

tig = min w(v). 

The spectral approach to this problem is motivated by the fact that Ai(£i) = min — = = min ij(x). 

xeR^ x 1 Dix x -.v^m d 
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2.1 The partial frustration constants 



Our objective is to relate Ai(£i) with (g and ?7g- Consider, however, the following example: a graph 
consisting of two disjoint components, one whose pij measurements are perfectly compatible and another 
one on which they are not. Its graph Connection Laplacian would have a non-zero vector in its null space, 
corresponding to just synchronizing the compatible component and being zero on the other (thus Ai (£1) = 0) . 
On the other hand, the constraint that v has to take values on S rf_1 , will enforce it to try to synchronize 
the incompatible part and will cause Cg an d r]G to be bounded away from zero. This example suggests a 
relaxation of the frustration constant to allow vertices to not be labeled (labeled with 0). We thus define 
( G , the partial S d_1 frustration £\ constant of G, as 

Cg = mm C(v), (2) 

v.V^S d - 1 u{0} 

and its £2 counterpart as n G = mi'n V :v^-s d - 1 u{o} n(v). 
We will show the following. 

Theorem 2. Let G = (V, E) be a graph. Given a function p : E — > 0(d), let £ G and n G be, respectively, the 
partial S rf_1 frustration l\ and £2 constants of G and Ai(£i) the smallest eigenvalue of the normalized graph 
Connection Laplacian. Then 



<C G < V| A 1(A) and Ax(A) < V* G < 2j|Ai(A), (3) 



Furthermore, if d = 1, the stronger inequalities hold, (q < y^2Xi(Ci) and n* G < 2^2Ai(£i). 



An O(l) version of this inequality was shown in |17j . when p is the constant function equal to —1, in the 
context of the Max-cut problem. 

It is trivial to check that, for any x £ M. dn , and t > 0, ^(x*) < C( a;t )- This observation, together with the 
fact that the eigenvalue problem is a relaxation of r/ G , shows both left inequalities and the fact that the first 
right inequality implies the second in The first right inequality is a direct application of the following 
Lemma which is obtained by an adaptation of an argument in [17] . 

Lemma 3. Given x <G R dn there exists t > such that 




Moreover, if d = 1 the right-hand side can be replaced by ^J2n(x). 



In order to prove this Lemma we need the following inequality. 

Proposition 4. For any y and z unit vectors in K d , the following holds for any a > 1, 

\\y-z\\+a 2 -l<^-\\y-az\\(l + a). 



Proof. Let t = \\y — z\\, which implies that < t < 2. Since y and z are unit vectors it is straightforward to 



check that \\y — az\\ =,/l + a 2 - 2a (l — \t 2 \ Thus, it suffices to show 




1 - \* ; 



5 



for all < t < 2 and a > 1. Since both sides of ((4]) are positive, it is enough to show the inequality with 
both sides squared. Squaring and rearranging yields, 

t 2 + 2ta 2 -2t+ {a 2 l) 2 < | ((a 2 - l) 2 + at 2 (l + a) 2 ) . 

This is equivalent to the non- negativity, in the interval [0, 2], of a certain quadratic function of t: 

(^a(l + a) 2 -ljt 2 - {2a 2 - 2) t + 1 {a 2 - if > 0. 

Since, for a > 1, 

(2a 2 - 2) 2 - 4 (Ja(l + a) 2 - lj \ (a 2 - if = (a 2 - if (4 - |a(l + a) 2 + < 0, 
the quadratic is always non- negative and thus non- negative in [0, 2]. □ 



Proof, [of Lemma [3] Let us suppose, without loss of generality, that x is normalized so that maxj ||xj|| = 
1. We will use a probabilistic argument. Let us consider the random variable t drawn uniformly from 
[0,1] and recall that x l is defined by x\ = if ||:Ei|| 2 > t or x \ — if ||xi|| 2 < t. We will show that 

\ E ^'e^ t^llz^l 3 ^ ^ — \J~\^^)i w hich implies that at least one of the realizations of t must satisfy the 
inequality, and proves the Lemma. 

We start by showing that, for each edge 



m4-Pii*u^% x i-f* x d(\\*i\\ + \\*i\\) 

Without loss of generality we can consider pij = I and ||xj|| < \\xi\\ and get, 



E\\x\- 4|| = |NI 



(IK 



I 2 -Ilx,ll 2 ) 



Thus, it suffices to show 



■>'i 



+ (INI 2 -INI 2 ) ^11 (IWI + 



(5) 



which is a consequence of Proposition U for y = -r^u, z = tt^tt and a = 1^-41. Now, using JSl), the linearity 

\\ X 3 II \\ X i II II^J II 

of expectation, and the Cauchy-Schwartz inequality we have 



V5 



< 2 



t^ij 1 1 Pij •Ej I 



Since ^« w. 



2r )(x)T,i d i\\ x i\\ 2 and 



Xif + lkdl 2 ) 



45>imi 2 , 



we have 



E^^llxf-^H < y ^^dil.f = I^E^diH^I =2W / |^E^ ( i i ||x*| 
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which completes the proof. When d = 1 the sharper result can be obtained by noting that ((5j) holds even 
without the factor. □ 

It is worth noting that the proof of Lemma [3] provides an algorithm for finding v : V — > U {0} such 

5 



that < y|Ai(£i) < y§Cg- ^ consists of setting x as the minimizer of rj(x) and finding t such 

that C,{x l ) < j\i (Ci) < J |Cg- This algorithm is computationally tractable because minimizing rj(-) is 

equivalent to solving an eigenvalue problem and, instead of testing x l for every t > 0, one can limit it to the 
values t for which there exists i € [n] such that \\xi\\ — t. Trevisan [17j iteratively performs this procedure, 
in the subgraph composed of the vertices left unlabeled by the previous iteration, in order to label the entire 
graph. 



2.2 The frustration constants for localization in E> d 1 

In this section we obtain bounds for (^q and t\q. The intuition given to justify the relaxation to partial 
frustration constants was based on the possible poor connectivity of the graph (small spectral gap). In 
this section we show that poor connectivity, as measured by a small spectral gap in the normalized graph 
Laplacian, is the only condition under which one can have large discrepancy between the frustration constants 
and the spectra of the graph Connection Laplacian. We will show that, as long as the spectral gap is bounded 
away from zero, one can in fact control the full frustration constants. The formal result is presented in the 
following Theorem. 

Theorem 5. Let G = (V,E) be a graph. Given a function p : E — > O(d), let £g and t\q be, respectively, the 
§> d 1 frustration l\ and £2 constants of G , Ai(£i) the smallest eigenvalue of the normalized graph Connection 
Laplacian and A2(£o) the second smallest eigenvalue of the normalized graph Laplacian. Then, 

A 1 (A)< f?G <44^4, 

-Mm); 

and IMA) < Cg < Hylg}. 

We start by stating and proving a few results that are going to be important to both the proof of Theorem 
[5] and the forthcoming sections. 

Lemma 6. Given x G R dn , there exists a x > 0, such that r x = x — a x x satisfies \\r x \\1 1 < >^rg~) ll^lli)! • 

Proof. Let us define n x S Ml™ by (n x )i = ||xi||. We now set a x = argmin Q \\n x — al||u . A simple calculation 
reveals that this gives 

_ l T D n x 
ax ~ 1 T D 1 ' 

Since n x is a non-negative vector, a x is non-negative as well. Let us also define u x 6 M™ so that (r x )i — 
(u x )iXi. This implies that u x — n x — \tp"x° !• Thus, 



uIL Ux = n T x L Q n x = ^ ^ WijGMI - Nil) 2 < \ ^w lJ \\x l - p lJ x j \\ 2 = ^Nl^ 

ij ij 

Since ulD l = 0, we have > A 2 (£ )- This shows that H^H^ = H^H^ < ^^y^l^IlD,- □ 

The previous Lemma gives an upper bound on how "unbalanced" a good candidate for synchronization 
can be. Intuitively this should imply that, for most vertices i, Xi has roughly the same norm. What follows 
clarifies this thought. 



7 



Definition 7. Given x £ M. dn , normalized so that \\x\\ 2 - )i = vol(G), and a positive number 8, we define the 
Ill-balanced vertex subset of the graph G as Tb x (5) = {i £ V : \\\xi\\ — 1| > 6}. 

The volume of lb x (8) is controlled by the following Lemma. 
Lemma 8. Let x £ R dn satisfy \\x\\ 2 Dl = vol(G). Then, 

vol(lb x (5)) 4 v (x) 



vol(G) -(S 2 A 2 (£ )' 



Proof. LemmaJS] guarantees the existence of a x £ Kg such that r a = x — o^i satisfies H^Hf^ < a 2 (£o) H^H-Pi ' 
Let us start by bounding a^; by the triangle inequality, 

(l- ax ) 2 vol(G) = (\\x\\ Dl -a x \\x\\ Dl f < \\r x f Di < J^vol(G), 

which implies (1 — a x ) 2 < j^nh ■ 



If i £ I6 X (5) then - 1| > <5, which implies ||(r T ),|| = ||.x',|| - a x \ > |||:r 4 || - 1| - |1 - a x \ > 5- \J x V 2 (c ) 
Squaring both sides of the inequality and summing over all i £ lb x {5) gives, 



vol(G) > \\r x f Di > Y, *ll(^)i||l>vol(X6 fc )(5-W-^l , (6) 



A 2 (£ ) v ' " " ^ ~ ^ -"v*™" ~ V A 2(£o) 

•ti.O|, 



as long as 5 > ^ Xo(c ) ■ Let us separate in two cases: 



If | > J Xz(c a ) ' th en i using (j6|) we have, 



vol(I6 fe ) < rf(x) f s _ / rj(x) \ < ??(x) f $ _ &\ 2 _ 4 



vol(G) " A 2 (£ ) V V A 2(£o) / " A 2 (£ ) V 1) ^ H^o) 



If, on the other hand, % < \ ttt^t, then, since vol ,^' > < 1, 

' ' 2 — y A 2 (£o) vol(G) — ' 

vol(J6 fc ) < < 7j(a;) ^<5\,~ _ 4 77(a) 



vol(G) " " A 2 (£ ) \2J S 2 X 2 (C ) 



□ 



By having a bound on how large the set of Ill-balanced vertices can be (Lemma |8]) we can control how much 
r]{x) is affected when we locally normalize x. This statement is made precise in the following Lemma, which 
finalizes the proof of the Theorem. 

Lemma 9. For every x £ R dn , r](x) < x ^ Q ^ r](x). 

Proof. We want to bound r)(x) = 2 V oi(G) ^ij ^'j'II^ — PijXj\\ 2 - Without loss of generality we can assume 
Hxll^ = vol(G). Let < 7 < 1, then 

V ^ ~ 2vol(G) I ^2 W *iW i i ~ Pi3 £ j\\ 2 + Yj YjWiiWxi- PijXjf + Yj 



VJij \ \Xi Pij Xj I 



S vol(G) + 2vol(G) ^ lAl Pl]XA ' 
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By Lcmmalwc have 4«1 < If ^ 



Note that, for any y,z <E M d , 



v z 

Ml II --II 



^ min{L|f,|| Z ||} - B y Settin g 2/ = ^ and Z = Pij X j We S et 



II** — PiMW ^ Jn{\\x^\x)\\} - THiS im P li6S tHat 



1 \ ^ „„ ||- „ - ||2 / 1 ST f \\ x i-Pii x i\ 



2 V0 l(G).jC w * Pl ^ f ~ 2vol(G)..J?_^V 



~ 2vol(G) (l- 7 ) 2 ^11^ ^^ll 2 



This means that 



16 77(xi 
/16 1 1 \ , , 

Since A 2 (£o) < 1 (see, e.g., [6]), it is possible to pick 7 (e.g. 0.7) such that ^ \ 2 (c ) + (1-7)^ — a 3 (£ q ) • 
finalizes the proof. □ 

Lemma [9] immediately implies the bound for t\g in Theorem [5j In order to obtain the bound for one can 
simply note that, if using Lemma [3] for x, one has x l = x, for every * > 0. Thus C(i) < \lh(x). 
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3 The 0(d) Cheeger inequality 

We present now our main result, 

Theorem 10. Let Xi(Ci) and Xi(Co) denote the i-th smallest eigenvalue of, respectively, the normalized 
Connection Laplacian C\ and the normalized graph Laplacian Cq. Let §q and vq denote, respectively, the 
0(d) frustration t\ constant and the 0{d) frustration £2 constant of G. Then, 



d 



i=l 

and 

d 



1 



\ E X ^ ^ < 1026rf3 TT^T E 

For the S d_1 case, the main difficulty that we faced in trying to obtain, from eigenvectors, candidate solutions 
for the localization problem was the local unit norm constraint. This is due to the fact that the localization 
problem requires its solution to be a function from V to S d_1 , corresponding to a vector in M. dn whose vertex 
subvectors have unit norm, while the eigenvector, in general, does not satisfy such a constraint. Nevertheless, 
the results in the previous section show that, by simply rounding the eigenvector, one does not lose more 
than a linear term (if considering rjo), given that the graph has a spectral gap bounded away from zero. 

However, the 0(d) synchronization setting is harder. The reason being that, besides the local normalization 
constraint, there is also a local orthogonality constraint (at each vertex, the d vectors have to be orthogonal 
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so that they can be the columns of an orthogonal matrix). For § d_1 we locally normalized the vectors, by 
choosing for each vertex the unit vector closest to Xj. For 0(d) synchronization we will pick, for each vertex, 
the orthogonal matrix closest (in the Frobenius norm) to the matrix \x\ ■ ■ ■ xf\ . This can be achieved by the 
Polar decomposition. Given a d x d matrix X, the matrix U(X), solution of minj/eofd) \\U(X) — X\\p, is one 
of the components of the Polar decomposition of X (see [SJ EI] and references therein) but, slightly abusing 
notation, we will call it the Polar decomposition of X. We note that U(X) can be computed efficiently 
through the SVD decomposition of X. In fact, given the SVD decomposition of X, X = XJYA/ T , the polar 
decomposition of X is given by U(X) — UV T (see [5]). 

We will show the following Lemma 

Lemma 11. Given x 1 , . . . , x d £ M. dn such that (x k , x ) d 1 = for all k ^ I, consider the potential O : V —> 
0(d) given as Oi =U (Xi) where Xi = ^x\ ■ ■ ■ xf\ and U(X) is the Polar decomposition of X , the orthogonal 
matrix that is closest to X in the Frobenius norm. If Xi is singula^ U(Xi) is simply set to be Id. Then, 

,(0)<(2d^ + 2^)-^—J2^ xl )- 
We need the following result of Li [10] . 

Lemma 12 (Theorem 1 in [10]). Let A,B € C dxd be non-singular matrices with polar decompositions 
A = U(A)P and B = U(B)P'. Then \\U(A) - U{B)\\ F < ( a)+<t ■ (B) W A ~ B \\f' where <?™i"( A ) is the 
smallest singular value of the matrix A. 

In order to be able to bound from below the smallest singular value of X = \x\ ■ ■ ■ x d ] we will introduce a 
notion similar to Xb x (6) but designed to take into account local orthogonality instead of local normalization. 

Definition 13. Given x,y € K dra two D\-orthogonal vectors, normalized so that \\x\ljj = = vol(G), 

and a positive number S, we defined the Ill-balanced vertex subset of the graph G as 

lb xy (5) = {i€V:\{xi, yi )\>5}. 
The following Lemma illustrates the purpose of this definition. 

Lemma 14. Let x 1 ,...,x d € ffi dn be D\-orthogonal vectors, normalized so that \\x k \\j :) = vol(G). Let 

us define the "balanced" set B as the complement o/Ufee[d] {^°x k (j^) ^Um6[d]\(t)^i'i'" (lb))- ^ or °" 
i,j € B, we have 

\\U(Xi) - Pl3 U(X 3 )\\ F < V2\\X t - PijXjWp. 

Proof. For i <E B, consider the gram matrix Xj Xi. Its fc-th diagonal entry satisfies ||2;^|| 2 > (l — ^j) 2 > 
1 — On the other hand the non-diagonal entries are, in magnitude, smaller or equal to ^j. By 
the Gershgorin circle theorem, the smallest eigenvalue of XfXi, which is equal to <7 m i n (X;) 2 , satisfies 
CT m i„(Xi) 2 > 1- 33 - (d- 1)^. Hence, tj min (X i ) > By observing that U{p. lJ X j ) = p ij U{X j ), and 

using Lemma IT2I we get ||t/(Xj) — pijU(Xj)\\p < V2\\Xi — pijXj\\p. □ 
Now we want to bound the size of lb xy (S). 

Lemma 15. Let x,y e M. dn be D\-orthogonal vectors such that \\x\\'j :)i = \\y\\'j) 1 = vol(G). Then, 

voi(ib xy (±)\(ib x (±)uib y (±))) M + v(y) 

vol( G) " m) A 2 (£ ) ' 

5 In this case the uniqueness of U(Xi) is not guaranteed and thus the map is not well-defined. 
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Proof. Let us consider the vector u = -^(x + y). It satisfies HitHf^ = vol(G) and by the triangle inequality 

on the norm || • || Ll , r)(u) < rj(x) + rj(y). By Lemma|we get ^^if^ < 4(8d) 2 ^^nffi ■ We conclude the 
proof by noting that lb xy (^) C lb x (±){Jlb y (±)uTb u (^). In fact, \ii<£lb x (±) Ulb y (^)uZ6 u (^) 
than|(x i , B >|= ||mf - ''"»'+»»»' <(l + ^) 2 -(l-i) 2 = i. □ 



Proof, [of Lemma 111] 

Let us consider £> as defined in Lemma [HI meaning S c = Ufce[d] (z°x k (^j) U Ume[rf]\{/c} ^x k x m (53) 
want to bound i/(0) = Mv * (g) £ 4j ||0 4 - PijOjfp. Since (6 x £) c C (B c x V) U (V x B c ), 

E - pv°j\\ 2 F < E w ij\\ u ( x i) - piMXj)\\ 2 F + 2 E E w v\\°i - hjOjWf 

ij (ij)eBxB iee<= jev 

< 2^m iJ ||X J - pyXjUl, + 8rfvol(6 c ) , 



We 



where the second inequality was obtained by using Lemma [14] and noting that Oi is an orthogonal matrix. 



To bound vol(2? c ) we make use of Lemmas 151 and [T5l and get that 



E 



vol(G) 

vol (lb xk (£)) 1 vol (Xb x , xm (£) \ (&) (£))) o8 ^ 3 E fce[(i ] 



vol(G) 



vol(G) 



is bounded above by 
< 2 s d 



A 2 (£o) 



fce[d] fcG[d] mG[d]\fe 

Since - PijXj\\ 2 F = 2 vol(G) ELi »?(**)> ™ get z/(0) < (2d- 1 + 2 10 d 3 ) ^ 5^ ^j. □ 



Proof, [of Theorem [10] 

The lower bound for vq is a simple consequence of the fact that the eigenvector problem is a relaxation of 
the Synchronization one and the upper bound is a direct consequence of Lemma 1111 

The lower and upper bounds for 'Oq can be obtained by comparing §q with vq{0). It is easy to check that 
$g(0) > vg{0), which gives the lower bound. 

On the other hand, denoting the fc-th column of Oi by (Oi) k , we have 

1 



MO) 



Vdvol(G) 



E^ 



\ k=l 



- ^E^g)E^II(°^-^(^), 

v k=l v ' ij 



k=l 



^ 2dvol(G) ^ 



EE^' U°i)-k-pv (°j)-k\ 



(7) 

(8) 



where JjJ) is obtained using Lemma [3] (via the same observation as was made in the proof of Theorem [5]) 
and ((HJ) is a direct consequence of Cauchy Schwartz inequality. This gives the upper bound for da and thus 
completes the proof. □ 
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4 Spectral 0(d) Synchronization Algorithm 



The proof of Theorem [TU1 suggests the following algorithm to solve 0(d) Synchronization. 

Algorithm 16. Given a weighted graph G = (V, E) and a function p : E — > 0(d), build the graph Connection 
Laplacian L\ and the degree matrix D%. Compute x l ,...,x as the first d (Di- orthogonal) minimizers of 
frj^f (that correspond to an eigenvector problem). Then, for each vertex i, compute the polar decomposition 
of the d x d matrix at that vertex, obtaining the orthogonal matrix Oi. 

The proof of Theorem [TU] guarantees that, if the graph G is sufficiently connected and vq is small, then 
Algorithm \W\ will compute a good solution. 

We note that an SO(3) version of this Algorithm was proposed in [T3] (without performance guarantees) 
and also used in [18]. Also, a very similar SO(2) version was introduced in QT|. In [IT] the resilience to a 
certain random noise model is analyzed. 

The performance guarantee for this algorithm relies on the fact that the vectors obtained are £>i-orthogonal. 
However, in practice, due to possible errors in the calculations this condition might be perturbed and the 
D\ inner products of these vectors, although small, may no longer be exactly zero. It is easy to adapt the 
analysis to this setting and show that it is in fact robust to such perturbations. 

The results in this paper also suggest an alternative to Algorithm [16] which corresponds to, instead of solving 
the eigenvector problem, determine the d vectors sequentially and, at each step, constrain on the vector 
being locally orthogonal to the previous ones (this can still be done efficiently, see [8]). After the d vectors 
are obtained one can simply locally normalize each one and output that as the Synchronization solution 
candidate. The issue with this method is that its iterative nature makes its analysis more difficult, as it 
is hard to guarantee that small errors in the first few vectors would not greatly affect the remaining ones. 
Also, numerical simulations suggest that the performance, in practice, of both methods is roughly the same. 

5 Tightness of results 

Let us consider the ring graph on n vertices G n = (V n ,E n ) with V n — [n] and E = {(i, (i + l)mod n), i £ [n]} 
with the edge weights all equal to 1 and p : V — > 0(d) as P( n ,i) — I arL d p = I for all other edges. Define 

x G R dn by x k = [2 1 - 1, 0, . . . , 0] T . It is easy to check that r](x) = C(n" 2 ) and that, for any t > 0, if 
x l ^ 0, there will have to be at least one edge that is not compatible with x 1 , implying £(x*) > and 
ij( xt ) > This shows that the 1/2 exponent in Lemma[3]is needed. In fact, by adding a few more edges 
to the graph G n one can also show the tightness of Theorem [2] Consider the "rainbow" graph H n that is 
constructed by adding to G„, for each non- negative integer k smaller than n/2, an edge between vertex k 
and vertex n — k with P(k,n-k) — ~ I- The vector x still satisfies r/(x) = 0(n~ 2 ), however, for any non-zero 
vector v : V — > S d_1 U {0}, it is not hard to show that both ((v) and 77(17) have to be of order at least n" 1 , 
meaning that both and rj* G are £l(^/\i(L\)). This also means that, even if considering 77^ , one could not 
get a linear bound (like Lemma [5] provides) without the control on A2(£o)- 

Theorem 1101 provides a non-trivial bound only if A2(£o) is sufficiently large. It is clear that if one wants 
to bound full frustration constants, a dependency on \2(Cq) is needed. It is, nevertheless, non-obvious 
that even if considering partial versions of O(d) frustration constants, i)q or Vq, the dependency on A2(£o) 
is still required. This can, however, be illustrated by a simple example in 0(2); consider a disconnected 
graph G with two sufficiently large complete components, G 1 — (V 1 ^ 1 ) and G 2 = (V 2 ,E 2 ). For each 
edge let pij ~ [ ? ] - It is clear that the vectors x 1 and x 2 defined such that x\ — [0, lyi(i)] T and 
x 2 = [0, 1^2(1)] are orthogonal to each other and lie in the null space of the graph Connection Laplacian 

6 The fact that the calculation of one of the vectors greatly depends on the ones already computed. 
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of G. This implies that A2OC1) = 0. On the other hand, it is straightforward to check that neither "d* G or v* G 
are zero because it is impossible to perfectly synchronize the graph (or even any of the components). 

6 Concluding Remarks 

Synchronization is a challenging problem. Recent discoveries [T2] suggest that spectral methods are very 
promising as a feasible method to solve this problem. In fact, in probability guarantees of performance 
are given for the performance of a spectral method to solve the £0(2) synchronization problem under a 
certain random noise model. Nevertheless, to the best of our knowledge, Algorithm [16] is the first method for 
0(d) synchronization having a (deterministic) worst case performance guarantee. As one would expect, the 
worst case performance is significantly weaker then the kind of probabilistic guarantees, given a specific noise 
model, e.g. as the one given in |llj . In fact, the guarantee in |llj is given in terms of distance between the 
candidate solution and the ground truth, while the one on Algorithm[l6]is given in terms of the compatibility 
error. 

In special applications one knows, a priori, that every element in the potential has positive determinant. 
This corresponds to a synchronization problem in SO(d). Although this can be viewed as a special case of 
the 0(d) problem it is expected that the additional structure can be leveraged to improve the algorithm (and 
the analysis). In particular, the first d — 1 columns of a matrix in SO(d) completely determine the matrix. 
This suggests that the SO(d) synchronization problem is solvable by just the first d — 1 eigenvectors of the 
Connection Laplacian, instead of the d first ones. In fact, the SO(2) Synchronization problem is equivalent 
to the S 1 localization one, and the guarantees for the S 1 localization problem were given solely in terms of 
the first eigenvalue of the graph Connection Laplacian. We leave the improved SO(d) analysis for future 
work. 

One might argue that, in some applications, the weights on the edges of the graph do not have a clear 
meaning. The reason being that we may be given a few relative measurements pij and it is unclear how to 
give weights to such measurements. In such cases, since we have the freedom of choosing the weights of the 
edges and Theorem [TBI suggests that our method will work better with a large A2(£o): onc could compute 
the weights of the edges in a way such that A2(£o) is maximized. This problem is solved in |16j . The caveat 
is that, the new weights will affect the way the compatibility error is measured and the eigenvalues of the 
Connection Laplacian. It is thus still unclear if such an approach would improve the method. Another 
interesting possible outcome of a procedure of this nature is a possible ranking of the edges, large weights 
would likely tend to be given to edges that are more important to ensure the connectivity of the graph. 

The classical Cheeger Inequality has an analogous result in smooth manifolds (actually, the first to be 
shown [4]). One interesting question is whether the Theorems in this paper have an analogous smooth 
version. Onc difficulty is to understand what would correspond to the frustration constant on the smooth 
case. Vector Diffusion Maps [13] suggest that an analogous result in smooth manifold would be related to 
the parallel transport and its incompatibility and some results in Differential Geometry [3] suggest that the 
Holonomy could be a geometric property that corresponds to the frustration constant. These suggestions are 
"coherent" because Holonomy can, in some sense, be viewed as the incompatibility of the Parallel transport 
(due to the curvature of the manifold). 
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